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Abstract — In this paper, we have proposed a priority based service time distribution 
method using Erlang Distribution for K-phases. The process of entering into the cloud is 
typically based on the priority of the service and form a queue. Each user needs to wait until 
the current user is being served if priority of the service is same, as per the First Come First 
Served policy. If the server is busy, then the user request has to be waited in the queue until 
the current user receives the result of the prescribed tasks. We have considered a priority 
class refers to a collection of all customers having the same priority. We introduce M/E K /1 
model for a priority based single server and M/E K /2 model for priority based two servers in 
each class in cloud computing scenario to reduce overall mean queue length and waiting 
time. We have considered a single server and two server retrial queueing systems in which 
customers arrive in a Poisson process with arrival rate X. We considered a single server and 
two server queues with three classes of customers. 

Index Terms — Cloud Computing, Erlang Distribution for K-phases, Waiting time, Queue 
length, Priority based service class. 

L Introduction 

Cloud computing is a new cost-efficient computing paradigm in which information and computer resources 
has been accessed from Web browser by users. Queueing systems in which arriving customers who find the 
server busy may retry for service after a period of time is called Retrial queues. In our system there are three 
priority classes, which define the order in which the service will be provided, since scheduling the queue in 
this manner may cause starvation, therefore we have implemented ageing technique to prevent this. If a new 
customer does not find any free server after connecting to the cloud service, then the system automatically 
redirects the request towards a waiting queue. At that moment, if the waiting queue is also fully occupied by 
other customers, then the newly arrived customer has to retry for service after certain time period. This 
technique is known as Retrial queues [1, 2]. Priority of service request is an important issue in scheduling 
because some request should be serviced earlier than others. These are requests which can't stay for a long 
time in a system. The M/E k /n (in this case, n=l or n=2) queuing system with cloud computing service station 
is very useful to provide basic framework for efficient design and analysis of several practical situations 
including various technical systems. In cloud computing environment, multiple servers have been utilized 
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instead of single server to achieve higher computation power. In this manner, the system performance is 
increased reducing the mean queue length and waiting time. In multiple servers [3] scenario, cloud service 
[4] request needs no waiting time for a long time period and also the queue length would be less. 
Our proposed approach using M/EK/1 [5, 6] and M/EK/2 
[6, 7] priority based models has been described in Section 

II. Experimental results using priority based Erlang distribution method has been shown in Section III. 
Conclusion is depicted in Section IV. 

n. Proposed Work 

In this section, a priority based single service channel queuing system cloud model M/E K /1 has been 
proposed. 

Consider a single server retrial queueing system in which customers arrive in a Poisson process with arrival 
rate X. These customers are identified as primary calls. Further assume that negative customers arrive at a rate 
v which follows a Poisson process. Gelenbe [8] has introduced a new class of queueing processes in which 
customers are either Positive or Negative. Positive means a regular customer who is treated in the usual way 
by a server. Negative customers have the effect of deleting some customer in the queue. In the simplest 
version, a negative arrival removes an ordinary positive customer. 

Let k be the number of phases in the service station. Assume that the service time has Erlang-k distribution 
[9] with service rate ku for each phase. We assume that the services in all phases are independent and 
identical and only one customer at a time is in the service mechanism. If the server is free at the time of a 
primary call arrival, the arriving call begins to be served in Phase 1 immediately by the server then 
progresses through the remaining phases and must complete the last phase and leaves the system before the 
next customer enters the first phase. If the server is busy, then the arriving customer goes to orbit and 
becomes a source of repeated calls. This pool of sources of repeated calls may be viewed as a sort of queue. 
Every such source produces a Poisson process of repeated calls with intensity (T. If an incoming repeated call 
finds the server free, it is served in the same manner and leaves the system after service, while the source 
which produced this repeated call disappears. Otherwise, the system state does not change. 
There are 'k' numbers of phases in our cloud system. Basic model of single server is shown in Fig. 1. 
Number of phases is four, if service is not available; where as if service is available, then number of phases 
becomes five. Only one server is considered in this case. We have considered three priority classes. The 
buffer is assumed to be infinite in each class. The priority based queuing discipline is first-come-first-serve 
(FCFS). Users' requests are being served one at a time. A new service with same priority could not be started 
until all the k-phases have been executed. Therefore, each arrival of users' requests increases the number of 
phases by k in the overall system. A mathematical basis for queuing theory [10] is provided for better 
understanding and for higher prediction based on the behavior of communication network. 
We assume that the access from the queue to the service facility follows the exponential distribution which 
may depend on the current number n, (n > 0) the number of customers in the orbit. That is, the probability of 
repeated 

attempt during a particular interval , given that there are n customers in the orbit at time t is na At. It is called 
the classical retrial rate policy. The input flow of primary calls, interval between repetitions and service time 
in phases are mutually independent. 



Arrivals 




Queue Server 



Fig.l. Basic Single Server Queuing Model 

X = Arrival rate of customers 

|J = Expected number of customers completing service per unit time 
Expected number of phases in the system = 
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Ls(k)=— (1) 

2 |i-X 

Expected number of phases in the queue = 

L q(k) = yM=**±*-±- (2 ) 

Expected number of customers in the queue = 

Ls(k) - average number of phases in the service 
Lq = k 

1 k+1 \ k+lA 



k 2 [x-X 2 u J 

_ k-KL X 2 

2k * n(n-A) C "* 

Since, 1/u is the average service time per customer and (k+l)/2 is the average number of phases of one 
customer in service; therefore time taken for serving a customer will be (k+l)/2u. Thus, average numbers of 
phases that arrive in this time would be X(k+l)/2u. 

i 

Average service time per customer = - (4) 

k + 1 

Average number of phases of one customer in service = (5) 
Expected waiting time of a customer in a queue = 

mi Lq k+1 X 

WQ = -r = — *-, — rr (6) 

X 2k (i(n-A) v ' 



Expected waiting time of a customer in the system = 

Ws = Wq + - (7) 

Expected number of customers in the system = L s = L q + - 

k+1 A 2 A 
~ 2k jt()i-A) U 

V 2k iiiii-A) uJ 

= XW S (8) 

Average service time for k — phase = — (9) 

Consider, M/E K /2 queuing system having two servers (server 1 & server 2). Assume that the service time 
follows exponential distributions with the service rate 'ul' for "server 1" and 'u2' for "server 2"; such that 
ul > u2. Customer arrival technique is considered as well-known "Poisson" process with rate X and system 
has the provision for waiting queue. Each service request requires exactly one server for execution of the 
service; and the same priority service is served using typical FCFS technique. When there are several 
requests of same priority customers in the waiting queue, and a server is being released from its previous 
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task(s), then the first available customer request in the queue has been granted for execution. On the other 
hand, if a customer request arrives to an empty queue, it is directed to "server 1" immediately with a 
probability 'p' and the "server 2" with probability "1 - p". In case of two server based Erlang model, we have 
considered four numbers of phases, if service is not available at that moment. If service is available at any 
particular time, then the numbers of phases become six. 

The priority of the requested service entering in the system is first checked. If the priority is 1, then that 
particular service has the highest priority and the process of the service starts. If the priority is 2 or 3 then the 
existence of a customer service with a higher priority is checked. However, in our system, we have provided 
only three different priority classes for convenience. If there exist a service request with a higher priority in 
the queue then that service request (with the lower priority) has to wait till all the higher priority service 
request get the service. The priority of service classes 2 and 3 increases by one after every 60 seconds. This 
helps to avoid starvation to a great extent. After this, the requested service is served as per Erlang' s k-phase 
distribution with two servers. 

The flowchart of priority based different 'k' phases in proposed cloud for two servers has been shown in Fig. 
2. 

Expected number of customers (not phases) in the queue: Lq = 2K ^ 2 jjzj (10) 

It has been observed using equations (3) and (10) that difference between one server based 'L q ' is (u-X), and 
difference between two server based 'Lq' is (4u 2 -X 2 ). Mathematically, (4u 2 -X 2 ) is always greater than (u-X). 
Therefore, value of expected number of customers in the queue for two servers is always less than the value 
of expected number of customers in the queue for single server. Waiting time using two servers based 
queuing method is also decreased. 

Expected number of customers in the system: Ls = Lq + — (11) 

Expected waiting time of a customer in a queue: 



W q = ^ (12) 

i 

Expected waiting time of a customer in the system: Ws = Wq + - (13) 

t av = Average Waiting Time 
Here, we have considered a priority class refers to, a collection of all customers having the same priority. For 
example when the net arrival rate of a customer is say 20, we assume the arrival rates for the first, second and 
third priority classes are respectively. We have assumed that a priority class with lesser value of i has more 
priority and less arrival rate, i.e., the arrival rate of a class varies inversely as the priority of that class. 
W n = Steady state waiting time for priority-n class. 



W »=ir^ (14) 



Bn = l_Skii (16) 



Sfl 



S = Number of servers. 

u = Mean service rate per busy server. 

%i = Mean arrival rate for priority class i. 



A = 

i = l 



2> 
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We have assumed that 1 has been considered as highest priority and we also assume that the arrival rates of 

the priority classes vary exponentially. 

For example, for the ith priority class, the arrival rate 

*i= (xi)', 

Where x^ arrival rate of the 1 st priority class and so on. 

The users' service requests (UR) is provide as input to scheduler server. Then it gets the total available 
resources (AR) and the number of running VMs in the data center in cloud. Find a list of appropriate VMs 
(AVM) capable of provisioning the requested service class. Algorithm 1 described the priority based service 
request handling in each class and Algorithm 2 shows for customer request with priority handling using 
different phases in proposed Cloud. 

In general, in Erlang's k-phase distribution, the expected waiting time for a customer I the system, 

Ws=— *W„ + - (17) 

2k q + M v ' 

Now, for a priority class-i, we have, 

Ws i =^*(l/AB M B i ) + ~ (18) 

In this section some important performance measures along with formulas and their qualitative behaviour for 
various values of X, u,, k , v and a are defining by P( n , 0) = Probability that there are n customers in the orbit 
and server is free 

P( n , i) = Probability that there are n customers in the orbit and server is busy with customer in the ith phase 
The probability mass function of Server state Let S(t) be the random variable which represents the phase in 
which customer is getting service at time t . 

P: ZiJ* P( i,0) where s : 

In this way, 

P: £i=o"° p( i,k) where s : k 

The Mean number of busy servers 

MNBS = lH, B, £ H k p(ij) (19) 

The probability mass function number of customers in the orbit 

Let X(t) be the random variable representing the number of customers in the orbit. 
Prob (No customers in the orbit) = Xj=o k p( 0,j) 

Prob ( i customers in the orbit) = Xj=o k p( i j) 

The Mean number of customers in the orbit 

MNCO = XmT i (Z J= o k P( i,j)) (20) 

The probability that the orbiting customer is blocked 

Blocking Probability = 

2>i"Lj=i k p(iJ) (21) 
The probability that an arriving customer enter into service immediately 
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PSI = p( i,0) 



(22) 



Algorithm 1: 

Priority Based Service Request Handling in each class (PBSRHC) 

Input: User Service Request with priority 
Output: Select a Resource 

Step 1. Form Available VM List and 

UsedVMList and also available ResourceList from each priority class 

Step 2. If AVM (UR,AR) != 4> then 
VM=load-balancer (AVM (UR,AR)) and 
Deploy service on VM and 
deploy=true 

Else if UR has advance reservation 
Deployed=true 

Else if Resource Able To Host Extra VM then 

Start newVMInstance 

Add VMToAvailbaleVMList 

Deploy service on newVM 

Deployed=true 

Else 

queue serviceReuest until queueTime > waitingTime 

Deployed=false 

End if 

Step 3. If deployed then 
return successful 
stop 
Else 

return failure 
stop 

Algorithm 2: 

Algorithm for customer request with priority handling using different phases in proposed Cloud. 

Input: A service request by client in a cloud. 

Output: Requested service allocated to a particular server. 

Step 1: Authentication of users 

Step 2: Check Priority class. If priority ==1 then goto step 3 else if priority ==2 then goto step 4 else goto 
step 5. 

Step 3: If server busy, wait for completion of earlier priority 1 request otherwise allocate request to the server 
and provide service. 

Step 4- (a): Wait for completion of priority 1 customer request or wait for completion of earlier same priority 
request; set timer =1 and timer = timer+1 continues until timer > 60. 
Step 4-(b): If timer > 60 then priority = Priority - 1 and goto step 3. 

Step 5- (a): Wait for completion of priority 1 and priority 2 customer requests or wait for completion of 
earlier same priority request; set timer =1 and timer = timer+1 continues until timer > 60. 
Step 4-(b): If timer > 60 then priority = Priority - 1 and goto step 4(a). 

III. Experimental Results 

We have evaluated the performance of our priority based K-phase Erlang distribution method through 
simulations. With different set of service request simulation is done. In each run of simulation, consider a 
single server retrial queuing priority based system with Erlang-k distribution. Expected waiting time of a 
customer in the system is increased based on the increment in case of arrival rate in single server as referred 
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in Figure 2, Figure 5, Figure 8 and Figure 9 and multiple server scenarios as referred in Figure 3, Figure 4, 
Figure 6 and Figure 7. 

For X=20, we have 

Xi =2.3 1 (for priority class 1) 
X 2 =5.34 (for priority class 2) 
X 3 =12.32 (for priority class 3) 

For X=60 , we have 

X] =3.53 (for priority class 1) 
X 2 =12.46 (for priority class 2) 
X 3 =43.98 (for priority class 3) 

For A=120 , we have 

hi = 4.56 (for priority class 1 - PI) 
X 2 =20.79 (for priority class 2 - P2) 
l 3 =94.81 (for priority class 3 - P3) 

We see that the highest priority class has the lower arrival rate. 
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Fig 2. Wq for M/E k /1 with K=4 
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Fig 3. Wq for M/E k /2 with K=4 
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Fig 4. Ws for M/E k /2 with K=4 
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Fig 5. Wq for M/E k /1 with K=5 
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Fig 6. Wq for M/E k /2 with K=6 
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Fig 7. Ws for M/E k /2 with K=6 
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Fig 8. Ws for M/E k /1 with K=4 
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IV. Conclusion 



In this paper, we have successfully designed a queue based model for priority based Erlang Distribution for 
k- phases. Our design consists of single server with 4 and 5 phases and two servers with 4 and 6 phases. We 
have considered three priority classes. In this model, service requests have been executed using queues. 
Virtual machines [11] are modeled as service centers using M/E K /1 model and M/E K /2 model. Multiple 
servers based scenario has improved the performance based on our system based on queue length reduction 
and waiting time optimization over single server based scenario. Numerical results have been demonstrated 
exhibiting higher performance in case of our priority based approach for M/E K /2 which reduces the queue 
length and waiting time compared to M/E K /1. The low priority of service classes 2 and 3 increases by 1 after 
every 60 seconds. This approach avoids starvation. M/E K /1 has produced better result compared to M/M/l [6, 
12]. M/E K /2 has shown better throughput compared to M/M/2 [6, 12] in terms of average queue length and 
average waiting time. 
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